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We investigate uniformity properties of strategies. These properties involve sets of plays in order 
to express useful constraints on strategies that are not -calculus definable. Typically, we can state 
that a strategy is observation-based. We propose a formal language to specify uniformity proper- 
ties, interpreted over two-player turn-based arenas equipped with a binary relation between plays. 
This way, we capture e.g. games with winning conditions expressible in epistemic temporal logic, 
whose underlying equivalence relation between plays reflects the observational capabilities of agents 
(for example, synchronous perfect recall). Our framework naturally generalizes many other situa- 
tions from the literature. We establish that the problem of synthesizing strategies under uniformity 
constraints based on regular binary relations between plays is non-elementary complete. 

1 Introduction 

In extensive infinite duration games, the arena is represented as a graph whose vertices denote positions 
of players and whose paths denote plays. In this context, a strategy of a player is a mapping prescribing 
to this player which next position to select provided she has to make a choice at this current point of the 
play. As mathematical objects, strategies can be seen as infinite trees obtained by pruning the infinite 
unfolding of the arena according to the selection prescribed by this strategy; outcomes of a strategy are 
therefore the branches of the trees. 

Strategies of players are not arbitrary in general, since players aim at achieving some objectives. 
Infinite-duration game models have been intensively studied for their applications in computer science 
[3] and logic lil3l . First, infinite-duration games provide a natural abstraction of computing systems' 
non-terminating interaction [2] (think of a communication protocol between a printer and its users, or 
control systems). Second, infinite-duration games naturally occur as a tool to handle logical systems for 
the specification of non-terminating behaviors, such as for the propositional jU -calculus iflQl . leading to 
a powerful theory of automata, logics and infinite games |[T3l and to the development of algorithms for 
the automatic verification ("model-checking") and synthesis of hardware and software systems. In both 
cases, outcomes of strategies are submitted to &)-regular conditions representing some desirable property 
of a system. 

Additionally, the cross fertilization of multi-agent systems and distributed systems theories has led 
to equip logical systems with additional modalities, such as epistemic ones, to capture uncertainty lf27l 
EUdUEIlHOlCE]], an ^ more recently, these logical systems have been adapted to game models in order 
to reason about knowledge, time and strategies irTTl |T9l l9l. The whole picture then becomes intricate, 
mainly because time and knowledge are essentially orthogonal, yielding a complex theoretical universe 
to reason about. In order to understand to which extent knowledge and time are orthogonal, the angle of 
view where strategies are infinite trees is helpful: Time is about the vertical dimension of the trees as it 
relates to the ordering of encountered positions along plays (branches) and to the branching in the tree. 
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On the contrary, Knowledge is about the horizontal dimension, as it relates plays carrying, e.g., the same 
information. 

As far as we know, this horizontal dimension, although extensively studied when interpreted as 
knowledge or observation [@] [TTl [19] |8] [T] [9), has not been addressed in its generality. In this paper, 
we aim at providing a unified setting to handle it. We introduce the generic notion of uniformity proper- 
ties and associated so-called uniform strategies (those satisfying uniformity properties). Some notions of 
"uniform" strategies have already been used, e.g., in the setting of strategic logics ||29"1 151 [l9l and in the 
evaluation game of Dependence Logic [28), which both fall into the general framework we present here. 

We use a simple framework with two-player turn-based arenas and where information lies in posi- 
tions, but the approach can be extended to other settings. Additionally, although uniformity properties 
can be described in a set-theoretic framework, we propose the logical formalism RVJL which can be 
exploited to address fundamental automated techniques such as the verification of uniformity properties 
and the synthesis of uniform strategies - arbitrary uniformity properties are in general hopeless for au- 
tomation. The formalism we use combines the Linear-time Temporal Logic LTL [12] and a new modality 
R (for "for all related plays"), the semantics of which is given by a binary relation between plays. Modal- 
ity R generalizes the knowledge operator "K" of [15] for the epistemic relations of agents in Interpreted 
Systems. The semantic binary relations between plays are very little constrained: they are not neces- 
sarily equivalences, to capture, e.g. plausibility (pre)orders one finds in doxastic logic lTT6l . neither are 
they knowledge-based, to capture particular strategies in games where epistemic aspects are irrelevant. 
Formulas of the logic are interpreted over outcomes of a strategy. The R modality allows to universally 
quantify over all plays that are in relation with the current play. Distinguishing between the universal 
quantification over all plays in the game and the universal quantification over all the outcomes in the 
strategy tree yields two kinds of uniform strategies: the fully-uniform strategies and the strictly-uniform 
strategies. 

As extensively demonstrated in [22], uniform properties turn out to be many in the literature: they 
occur in games with imperfect information, in games with opacity conditions and more generally with 
epistemic conditions, as non-interference properties of computing systems, as diagnosability of discrete- 
event systems, in the game semantics of Dependence Logic. 

We investigate the automated synthesis of fully-uniform strategies, for the case of finite arenas and 
binary relations between plays that are rational in the sense of [6). Incidentally, all binary relations 
that are involved in the relevant literature seem to follow this restriction. In this context, two problems 
can be addressed: the fully-uniform strategy problem and the strictly-uniform strategy problem, which 
essentially can be formulated as "given a finite arena, a finite state transducer describing a binary relation 
between plays, and a formula expressing a uniformity property, does there exist a fully-uniform (resp. 
strictly-uniform) strategy for Player 1?". From [f22], the fully-uniform strategy problem is decidable but 
non-elementary - since then we have established that it is non-elementary hard. The algorithm involves 
an iterated non-trivial powerset construction from the arena and the finite state transducer which enables 
to eliminate innermost R modalities. Hence, the required number of iterations matches the maximum 
number of nested R modalities of the formula expressing the uniformity property. As expected, each 
powerset construction is computed in exponential time. This procedure amounts to solving an ultimate 
LTL game, for which a strategy can be synthesized [25] and traced back as a solution in the original 
problem. The decidability of the strictly-uniform strategy problem is an open question. 

The rest of the paper is organized in five sections. In Section [2l we present the standard material 
two-player turn-based arenas. We set up the framework and define uniform strategies in Section [3] and 
we illustrate the notion with two examples in Section |U Finally in Section [5j we give tight complexity 
bounds for the fully-uniform strategy problem, and we discuss future work in Section [6] 
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2 Preliminaries 

We consider two-player turn-based games that are played on graphs with vertices labelled with proposi- 
tions. These propositions represent the relevant information for the uniformity properties one wants to 
state. From now on and for the rest of the paper, we let AP be an infinite set of atomic propositions. 

An arena is a structure 5f = (V,E,vo,£) where V = V\ WV2 is the set of positions, partitioned between 
positions of Player 1 (Vi) and those of Player 2 (V2), E C (V\ x V2) U (V2 x V\ ) is the set of edges, vo G V 
is the initial position and £ : V — > &{AP) is a valuation function, mapping each position to the finite set 
of atomic propositions that hold in this position. Plays* and Plays m are, respectively, the set of finite and 
infinite plays. For an infinite play 71 = vovi . . . and i G N, := V; and n[0, i] := vq . . . v,-. For a finite play 
p =v v 1 ...v n ,last(p) = v n . 

A strategy for Player 1 is a partial function a : Plays* — > V that maps a finite play ending in V\ to the 
next position to play. Let a be a strategy for Player 1 . We say that a play n G Plays a is induced by G if 
for all i > such that n[i] G V\, n[i+ 1] = a(n[0,i]), and the outcome of a, noted Out(a) C Plays m , is 
the set of all infinite plays that are induced by a. Definitions are similar for Player 2's strategies. 

3 Uniform strategies 

We define the formal language RUL to specify uniformity properties. This language enables to express 
properties of the dynamics of plays, and resembles the Linear Temporal Logic (LTL) lfT2l . However, 
while LTL formulas are evaluated on individual plays (paths), we want here to express properties on 
"bundles" of plays. To this aim, we equip arenas with a binary relation between finite plays, and we 
enrich the logic with a modality R that quantifies over related plays, the intended meaning of "R(p holds 
in p" being "<p holds in every play related to p". 

The syntax of R LTL is similar to that of linear temporal logic with knowledge lilSl . However, we 
use R instead of the usual knowledge operator K to emphasize that it need not be interpreted in terms of 
knowledge in general, but merely as a way to state properties of bundles of plays. The syntax is: 

cp,Y ■■= p I ~<<p I <P A y I 0<p I <pUy/ I Rep peAP 

Consider an arena <S = (V,E,vq,£) and a rational relation ~> C Plays* x Plays*. A formula <p of 
KLTL is evaluated at some point i G N of an infinite play 71 G Plays m , within a universe IT C Plays a,. 
The semantics is given by induction over formulas. 

Tl,7i,i\=p if pe£(n[i]) U,7i,i \= -i<p if U,7V,i \fc (p 

U,7i,i ' \= (p A Y if n,7T,/ |= (p and U,n,i \= \p U,n,i\= Oq> if U,7l,i + l \= (p 
IT, 71, i \= (pU\j/ if there is j > i such that n, %, j \= y and for all i < k < j, II, K, k \= (p 
n, tz, i \= R(p if for all iz' G U,j G N such that n[0, i) %' [0, j), U,7t',j \= (p 

From this semantics, we derive two notions of uniform strategies, which differ only in the universe the 
R modality quantifies over: Out(a) or Plays a (with the latter, related plays not induced by the strategy 
also count). The motivation for these two definitions is clear from [22] where many examples from the 
literature are given. 

Definition 1 Let ^ be an arena, ~~>be a rational relation and (p be an RLTL formula. A strategy o is: 
(~>, (p)-strictly-uniform if for all % G Out(a), Out(o), 7t,0 \= (p, 
(~~>, (p) -fully-uniform if for all % G Out(a), Plays^^,® \= (p. 
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4 Concrete examples 

In this section we illustrate our notions of strictly and fully uniform strategies denned in the previous 
section with the examples of observation-based strategies in games with imperfect information, and 
games with opacity condition. 

4.1 Observation-based strategies 

Games with imperfect information, in general, are games in which some of the players do not know ex- 
actly what is the current position of the game. Poker is an example of imperfect-information game: one 
does not know which cards her opponents have in hands. One important aspect of imperfect-information 
games is that not every strategy is "playable". Indeed, a player cannot plan to play differently in sit- 
uations that she is unable to distinguish. This is why players are required to use strategies that select 
moves uniformly over observationally equivalent situations. This kind of strategies is sometimes called 
uniform strategies in the community of strategic logics dl29l l5l[l9l). or observation-based strategies in 
the community of computer-science oriented game theory ([8 ]). In fact, all the additional complexity of 
solving imperfect-information games, compared to perfect-information ones, lies in this constraint put 
on strategies. 

We show that the notion of observation-based strategy, and hence the essence of games with imper- 
fect information, can be easily embedded in our notion of uniform strategy. In two-player imperfect- 
information games as studied for example in 11261 181171. Player 1 only partially observes the positions of 
the game, such that some positions are indistinguishable to her, while Player 2 has perfect information 
(the asymmetry is due to the focus being on the existence of strategies for Player 1). Arenas are labelled 
directed graphs together with a finite set of actions Act, and in each round, if the position is a node v, 
Player 1 chooses an available action a, and Player 2 chooses a next position v' reachable from v through 
an a-labelled edge. 

We equivalently define this framework in a manner that fits our setting by putting Player 1 's actions 
inside the positions. We have two kinds of positions, of the form v and of the form (v,a). In a position v, 
when she chooses an action a, Player 1 actually moves to position (v,a), then Player 2 moves from (v,a) 
to some v'. So an imperfect-information game arena is a structure % mp = (Sf ,~) where Sf = (V,E,vo,£) 
is a two-player game arena with positions in V\ of the form v and positions in V2 of the form (v,a). 
We require that vE(v',a) implies v = v', and vo G V\. For a position (v,a) G Vz, we note (v,a).act := a. 
We assume that p\ G AP, and for every action a in Act, p a G AP. p\ holds in positions belonging to 
Player 1, and p a holds in positions of Player 2 where the last action chosen by Player 1 is a: £(v) = {p{\ 
for v G Vi, £{v,a) = {p a } for (y,a) G V 2 . Finally, ~ C Vf is an observational equivalence relation on 
positions, that relates positions indistinguishable for Player 1. We define its extension ~ to finite plays: 
vo(vo,ai)vi ...{v n -i,a n )v n ~ VQ{vo,a' x )v' l ...(v' n _ l ,d n )v' n if for all i > 0, v t ~ v\ and a t = a[. 

We add the classic requirement that the same actions must be available in indistinguishable positions: 
for all v, V G V\, if v ~ v' then vE(v,a) if, and only if, v'E(v',a). In other words, if Player 1 has different 
options, she can distinguish the positions. 

Definition 2 A strategy a for Player 1 is observation-based if for all p,p' G vq(V2Vi)*, p ~ p' implies 
(j(p).act = o(p').act. 

We define the formula 

SameAct := G(p]_ -s- \J ROp a ) 

aeAct 
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which, informally, expresses that whenever it is Player l's turn to play, there is an action a that is played 
in every equivalent finite play. 

Proposition 1 A strategy o for Player 1 is observation-based iff it is (~ ,SameAct)-strictly-uniform. 

Here we have to make use of the notion of strict uniformity, and not the full uniformity. Indeed, after 
a finite play n[0, i] ending in V\, we want to enforce that in all equivalent prefixes of infinite plays that 
conform to the strategy considered, Player 1 plays the same action. It would obviously make no sense to 
enforce the same on equivalent prefixes of every possible play in the game, which encompass all possible 
behaviours of Player 1. 

Notice that in order to embed the case of players with different memory abilities, e.g. imperfect- 
recall, one would just have to replace pa with the appropriate relation. 

For the moment we have not mentioned any winning condition. For a strategy, being (?»,SameAct)- 
strictly-uniform only characterizes that it is "playable" for a player with imperfect information, but it does 
not characterize the outcome of this strategy. However, if one considers a game with imperfect informa- 
tion in which the winning condition for Player 1 is an LTL formula (p, then the set of (pa, SameAct A <p)- 
strictly-uniform strategy is exactly the set of winning observation-based strategy. 

When talking about knowledge and strategic abilities, the question of objective vs subjective ability 
should be raised (see HH). The difference is basically whether a strategy is defined only on "concrete" 
plays, starting from the initial position, or if it has to be defined on all "plays" starting from any position 
the player confuses with the initial one. In the setting presented here, the initial position is part of the 
description of the arena, hence players are assumed to know it and all plays considered start from this 
position. But in order to model in this setting the case of Player 1 not knowing the initial position, one 
could add a fresh artificial initial position v' Q , from which no matter the action Player 1 chooses, Player 
2 can move to any position that Player 1 confuses with vq. Then, for a winning condition <p € LTL, the 
existence of an observation-based winning strategy for Player 1 from vo (resp. Vq) would denote objective 
(resp. subjective) ability to enforce (p. 

4.2 Games with opacity condition 

Games with opacity condition, studied in l23l . are based on two-player imperfect-information arenas, 
with Alice having perfect information as opposed to Bob who partially observes positions. In such 
games, some positions are "secret" as they reveal a critical information that Bob aims at discovering. We 
are interested in Alice's ability to prevent Bob from "knowing" the secret, in the epistemic sense. 

More formally, assume that a proposition p$ € AP represents the secret. Let = be an 

imperfect-information arena as described in Section 14.11 with a distinguished set of positions S C V\ 
that denotes the secret. Bob is Player 1 as he has imperfect information, and Alice is Player 2. Letting 
= (V,E,vi,£), we require that £~ l {{ps}) = S (positions labeled by p$ are exactly positions v € S). For 
a finite play p with last(p) € V\, Bob's information set or knowledge after p is I(p) := {last(p') \ p' € 
Playsz^p pa p'}. It is the set of all the positions he considers possible after observing p. An infinite play 
is winning for Bob if there exists a finite prefix p of this play whose information set is contained in S, 
i.e. I(p) C 5, otherwise Alice wins. It can easily be shown that: 

Proposition 2 A strategy o for Alice is winning if and only if o is (~,G^Rps)-fully-uniform. 

Here we are interested in Alice's strategies and Bob's knowledge. Since Bob only partially observes 
what Alice is playing, some plays that are not brought about by Alice's strategy are considered possible 
by Bob. Full uniformity is therefore the right notion to capture correctly Bob's knowledge. 
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Here again, to model different memory and observational abilities of Bob, one can use the appropriate 
binary relation, provided it is rational. Also, notice that though we chose to illustrate our framework with 
opacity aspects, any winning condition that is expressible by a formula of the epistemic linear temporal 
logic with one knowledge operator would fit in our setting. 

5 Synthesizing fully-uniform strategies 

In this section, we investigate the complexity of synthesizing a fully-uniform strategy. We first consider 
the associated decision problem, called here the fully-uniform strategy problem: given a uniform property 
(p G i?LTL, a finite arena Sf = (V,E,vq,£), and a finite state transducer T over alphabet V representing a 
rational binary relation between plays (see ||6]), does there exist a ([T], <p) -fully-uniform strategy in 
where [r] is the binary relation denoted by T . 

Definition 3 For a formula (p G RLTL, the J?-depth of (p, written dp((p), is the maximum number of 
nested R modalities in (p. For each k G N, we let RLTL^ := {<p G RLTL \ dj^{(p) = k}. 

Theorem 3 The fully-uniform strategy problem for formulas ranging over \J k<n RLTLk is «-Exptime- 
complete for n > 2, and 2EXPTlME-complete for n < 2. 

The proof for the upper bounds in Theorem [3] can be found in [22 ], in which we devise a decision 
procedure based on a powerset construction which simulates the execution of the transducer along plays 
in the arena, enabling the computation of information sets. Dealing with information sets enables us 
to perform K-modalities elimination, yielding a reduction of the initial problem to solving some LTL 
game. The procedure is however non-elementary since it requires one powerset construction per nesting 
of R-modalities. The proof for the matching lower bounds is a direct reduction from the word problem 
for exp[«]-space bounded alternating Turing Machines, which is (n + 1)-Exptime complete. Due to lack 
of space, it is omitted here. 

Corollary 4 The fully-uniform strategy problem is non- elementary complete. 

Regarding the synthesis problem, the procedure of ll25l for solving the terminal LTL game in the 
decision procedure of Theorem |3]is an effective construction of a winning strategy when it exists. This 
strategy provides a fully-uniform strategy of the initial game, by means of a transducer mapping plays of 
the initial game to plays in the terminal game. This transducer itself is straightforwardly built from the 
arena of the last game itself. 

6 Discussion 

We are currently working on sufficient conditions on the binary relation between plays to render the 
fully-uniform strategy synthesis problem elementary. It appears that being an equivalence relation is not 
enough, but if moreover the relation verifies a weak form of no learning property (see [ 14 1), the problem 
seems to be elementary. Concerning the strictly-uniform strategy problem, we conjecture undecidability 
in general, but we are investigating interesting subclasses of rational relations that make the problem 
decidable. 

It would then be interesting to extend the language to the case of n modalities Ri with n relations 
Also, the difference between the fully-uniform semantics and the strictly-uniform one could be at the 
level of modalities instead of the decision problems level. In Section 14.11 we have seen that uniformity 
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properties can represent uniformity constraints on the set of elegible strategies, and in Section 14.21 we 
have seen how they can represent epistemic winning conditions. However, while some properties require 
strict uniformity, others require full uniformity. Allowing to use both kinds of modalities in a formula 
would enable, for example, to express that a strategy must both be winning for some condition on the 
opponent's knowledge (with a fully-uniform modality, see Section |4~2"1 ). and to be observation based for 
the player considered (with a strictly-uniform modality). A formula of the following kind could be used 
for a variant of games with opacity condition where Alice would also have imperfect information (note 
that the arenas should be modified, and we assume that p2 would mark positions where Alice has to 
choose an action): 

cp:= G(p 2 ^ V<?0/> a ) A G<> PS 

aeAct 

S v ' V v ' 

Observation-based constraint Winning condition 
In a next step, we would like to consider how our framework adapts if we take as base language the 
one of Alternating-time Temporal Logic [2] instead of LTL, so as to obtain an Alternating-time Temporal 
Epistemic Logic-like language. It would enable us to express the existence of uniform strategies directly 
in the logic, and not only at the level of decision problems as it is the case for now. This step will 
require to pass from the two-player turn-based arenas considered so far to multiplayer concurrent game 
structures, that are ATL models, but the definitions should adapt without difficulties. However we should 
be cautious in generalizing these notions as undecidability will easily be attained. 
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